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ABSTRACT 

The stellar kinematics of galactic disks are key to constraining disk formation and evolution pro¬ 
cesses. In this paper, for the first time, we measure the stellar age-velocity dispersion correlation in 
the inner 20 kpc (~ 3.5 disk scale lengths) of M31 and show that it is dramatically different from that 
in the Milky Way. We use optical Hubble Space Telescope/Advanced Camera for Surveys photome¬ 
try of 5800 individual stars from the Panchromatic Hubble Andromeda Treasury (PHAT) survey and 
Keck/DEIMOS radial velocity measurements of the same stars from the Spectroscopic and Photomet¬ 
ric Landscape of Andromeda’s Stellar Halo (SPLASH) survey. We show that the average line-of-sight 
velocity dispersion is a steadily increasing function of stellar age exterior to i? = 10 kpc, increas¬ 
ing from 30 km s“® for the main sequence stars to 90 km s“® for the red giant branch stars. This 
monotonic increase implies that a continuous or recurring process contributed to the evolution of the 
disk. Both the slope and normalization of the dispersion vs. age relation are significantly larger than 
in the Milky Way, allowing for the possibility that the disk of M31 has had a more violent history 
than the disk of the Milky Way, more in line with ACDM predictions. We also find evidence for an 
inhomogeneous distribution of stars from a second kinematical component in addition to the dominant 
disk component. One of the largest and hottest high-dispersion patches is present in all age bins, and 
may be the signature of the end of the long bar. 

Subject headings: galaxies: spiral — galaxies: kinematics and dynamics — galaxies: individual (M31) 

— galaxies: stellar content 


1. INTRODUCTION 

Most late-type galaxies have multiple disk populations 
with distinct scale heights. The vertical surface bright¬ 
ness profiles of e dge-on spirals are well fit by d ouble expo¬ 
nential profiles (Yoachim & Dalcanton 2006), while dy¬ 
namically hot “thick disk” popu lations have b een found 
in both t he Milky Way (e.g ., Chiba 2000) and An¬ 


dromeda (ICollins et al.j|2011|) galaxies. While it is un- 
itne 


clear whether galactic disks typically contain two distinct 
components or a con tinuum of progress ively thicker pop¬ 
ulations as argued in||Bovy et al. (2012), the structure of 


the stellar disk is an important key to understanding a 
galaxy’s formation history. 

Thick disks are thought to have formed through some 
combination of the following three processes. First, stars 
can be formed in a cold, thin dis k and later be dynami- 
cally heated by satellite impacts ( Quinn et al.||1993[ Ve- 


lazquez &: White|[l999[ [Purcell et al.||2010| I'l'issera et al. 

2013p or by internal perturbers such as spiral arms, bars. 
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or scattering by giant molecular clouds ( Ida et al.|1993 ). 
Second, they can be formed “in situ,” from a thick, 
clumpy gas disk at high redshift whose remnants collapse 
further over time to form progre ssively younger, thin¬ 
ner, more me tal-rich stellar disks (Bournaud et al.||2009 


Forbes et al.||2012). Third, the dynamically hot popula¬ 


tion can be a ccrefed from satelli te galaxies through tidal 
interactions (Abadi et al. 2003). Each scenario should 
produce a difterent relationship between age, metallic- 
ity, and degree of heating. For example, accretion of 
metal-poor satellites onto a thin disk would create a bi¬ 
nary disk structure with thin and thick components, each 
with a distinct vertical scale height, age distribution and 
metallicity distribution. In contrast, a continuous pro¬ 
cess such as collapse of a clumpy gas disk or heating 
from frequent low-mass satellite impacts would produce 
disk layers whose thickness (degree of heating) increases 
with age. 

In nearby, low-mass, edge-on spiral galaxies, stellar 
populations’ vertical scale heights increase with age over 
three age bins, suggesting that a continuous process such 
as disk heating plays a r ole in the evolution of those 
galaxies (Seth et al. 2005). An alternative heating diag¬ 
nostic to scale height is velocity dispersion (cr„). Measur¬ 
ing as a function of age can yield even stronger phys¬ 
ical constraints on possible disk evolution mechanisms. 
Dispersion measurements are also less sensitive to dust 
than are scale height measurements, and are possible in 
galaxies that are not perfectly edge-on. 

However, meaningful kinematical measurements are 
difficult to make. In distant galaxies, kinematics derived 
from integrated-light spectra cannot differentiate be¬ 
tween the contributions from old, intermediate-age, and 
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young stellar populations. In particular, near-infrared 
light from old red giant branch (RGB) populations is con¬ 
taminated by flux from younger asymptotic giant branch 
(AGB) stars. In the Milky Way itself, we can more easily 
separate stars by age. Velocity dispersion appears to in- 
crease monotonically wi th age in the solar neighborhood 
(Nordstrom et al. 20041, but the rest of the disk is ob- 
scured to the extent that it is impossible to tell if the solar 
neighborhood is representative of the entire disk, much 
less to trace large-scale kinematical structure across the 
Galaxy. 

Additionally, a study of only one disk galaxy (for exam¬ 
ple, the MW) does not allow us to draw general conclu¬ 
sions about the structure of disk galaxies in general. In 
the case of the MW, this is an important concern. AGDM 
cosmology predicts that galaxies are built up via accre¬ 
tion of smaller satellites. While most collisions between 
host and satellite occur in the halo of the host, a disk 
the mass of the MW’s should have experienced a t least 
one enco unter with a massive (^ 3Mdisk) satellite (Stew- 
art|2008 1. Such a n encounter should significantly thicken 
and heat the disk (Purcell et al.|2009), but the disk of the 
MW does not exhibit any such signs of a cosmologically 
common heating event. Studying the detailed structure 
of a second galaxy disk - like that of Andromeda (M31) - 
can provide an important constraint on whether AGDM 
needs to be revisited or whether the Milky Way is simply 
an outlier in the collision frequency distribution. 

M31 is an ideal candidate for mapping a disk’s veloc¬ 
ity disper sion as a function of age. It is close enough 
(785 kpc; McGonnachie et al. ( 2Ci05| )) that we can iso¬ 
late and niap the velocity dispersions of the RGB, AGB, 
and young upper main sequence (MS) populations sep¬ 
arately, but distant enough that we can see the entire 
disk. We take advantage of data from two surveys of 
the disk-dominated region of M31. The Spectroscopic 
and Photometric Landscape of Andromeda’s Stellar Halo 
(SPLASH) survey has used the Keck/DEIMOS multiob¬ 
ject spectrograph to measure radial velocities of thou¬ 
sands of individual bright stars i n the inner 20 kpc 


3.5 disk scale lengths) of M31 (Gilbert et al. 2009 
Dorman et al. 2012| |2013| Howley et al.| 2012 f! Mean¬ 
while, the recently-completed Panchromatic Hubble An¬ 
dromeda Treasury (PHAT) survey, a Hubble Space Tele¬ 
scope MultiGycle Treasury (HST/MGT) program, has 



MS stars. 

Previously, we analyzed the kinematics of only the 
RGB stars in the intersection of the SPLASH and PHAT 
surveys. We found that 20% of the RGB stars our survey 
belonged to a population wtih spheroid-like kinematics: 
with a velocity dispersi on of 150 km s~^ and Umt./o' "^1/3 


(the ’’inner spheroid”; Dorman et al. (2012)). Later, we 


found that the inner spheroid population has a disk-like 
luminosity function d espite its spheroid-like kinematics 
(Dorman et al. 2013). In the current paper, we expand 
our survey to include three younger PHAT photometry- 
defined age bins in addition to the RGB population, and 
map the line-of-sight (LOS) velocity dispersion of stars 
in each age bin. In this paper, we aim to characterize 
the bulk properties of the disk in each age bin and so do 


not distinguish between “disk” and “spheroid” members. 
However, in §[^ we discuss the evidence for and possible 
origins of subsets of the RGB and AGB populations with 
atypical LOS velocities. 

This paper is organized as follows. First, in §[^ we 
present the dataset composed of Keck/DEIMOS radial 
velocity measurements and HST optical photometry of 
over 8200 individual stars. In §[^ we separate the stars 
into four age bins based on their position in the opti¬ 
cal color-magnitude diagram (CMD) and estimate RGB 
photometric metallicities. We also define our smoothed 
velocity dispersion statistic. We discuss trends and pos¬ 
sible biases in §|^ In §[^ we discuss the constraints our 
results place on disk evolution scenarios. Finally, we 
summarize in §[^ 

2. DATA ACQUISITION AND REDUCTION 

Our dataset is a subset of two larger surveys. Fig¬ 
ure [2 shows the spatial coverage of of both overlaid on 
a GALEX image of M31. We start with radial velocity 
measurements and optical HST photometry of 8265 stars 
in the inner 20 kpc of M31, the region dominated by the 
visible disk. The radial velocities were measured using 
the Keck/DEIMOS multiobject spectrograph between 
2007 and 2012 as part of the SPLASH survey (Gilbert 
et al. 2007 Dorman et al. 2012 |2013 ). The phot ome¬ 
try IS from HS'iyA GS/Wh'G via the PHAT survey (Dal- 
canton et al.||2012|). The filters used were ACS F475W 
and 1*'814W, which, for context, are roughly equivalent 
to Sloan g and Cousins I bands. Here we briefly describe 
the spectroscopi c sample; for information on t he PHAT 
photometry, see Dalcanton et al. (2012) and Williams 


et al. 


/2014). 

This paper co mbines spectroscopic data from several 
small er projects (Gilbert et al.|2007 Dorman et al.|201^ 


2013|. Therefore, the spectroscopic target selection tunc- 

44% of the targets, those 
were selected based 


tion is not homogeneous, 
observed between 2007 and 2010, 
on their apparent degree of isolation in a single-filter i' 
CFHT/MegaCam mosaic image. These targets primarily 
trace the dominant population in M31’s stellar disk: old, 
metal-rich RGB stars. Details on target selection tech- 
niques for these masks can be found in |Dorman et al.| 

([20T^. 

Because these objects are from masks designed prior 
to acquiring PHAT photometry at that location, they 
had to be later matched to their corresponding sources 
in the six-filter PHAT photometric catalogs. First, as¬ 
trometric offsets between each DEIMOS mask and the 
PHAT coordinate system were obtained and applied to 
the spectroscopic sample. Then, for each object in the 
spectroscopic sample, the brightest (in mF 8 i 4 w) PHAT 
star within a search radius of 0."5 was chosen as the 
match. 

The remaining 56% of the spectroscopic targets used 
in this work, observed in 2011 and 2012, were chosen 
based on existing PHAT photometry, eliminating the 
need for post-spectroscopy cross-catalog matching. Since 
we had color information at the target selection stage, 
we were able to prioritize under-represented populations 
over the dominant metal-rich red giants. For the five slit- 
masks, targeting about 1000 sources, observed in 2011, 
we chose red giants across the broad range of photomet¬ 
ric metallicities —2.0 < [M/H] < 0.2. We restricted 
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our sample to stars that were in the magnitude range 
20 < mpsiiw < 22, but otherwise chose stars randomly 
in position and magnitude space. (The quoted magni¬ 
tudes, like al l magnitudes in this p aper, are in the Vega 
system.) See Dorman et al. (2013| for more information 
on the HST-aided spectroscopic target selection in 2011. 

In 2012, we targeted about 4500 more stars across a 
broad range of ages, including young massive MS stars, 
intermediate-age AGB stars, and RGB stars. We also 
target ed a few young clus ters identified by the PHAT 


team ( Johnson et al.|2012 ), although those are not used 
in this work. All ot our targets were brighter than either 
iTiFsiiW = 22 or mFA 75 W = 24. Faint RGB stars (with 
^^ 814 ^ > 21.5) and faint MS stars (with mFsiAW > 21 
or mF 47 bw > 23) were given very low priority and were 
only used on the rare occasion that there was unused 
space to fill on the masks, but otherwise targets were 
chosen randomly within each evolutionary stage. These 
data are presented here for the first time; see Appendix 
A for a full data table. 

After observing, each raw 2D spectrum is collapsed in 
the spatial direction and cross-correlated against a suite 
of template rest-frame spectra to measure its radial ve¬ 
locity. Figure shows some representative spectra and 
zoomed-in views of the absorption lines that dominate 
the radial velocity measurement. Though the entire spec¬ 
trum is used in the cross-correlation, certain absorption 
lines are most important in determining the radial ve¬ 
locity. The Ga II triplet near 8500A is strong in RGB 
and many AGB stars. Temperature-sensitive TiO bands 
across the red side of the spectrum, including a strong 
triplet near 7050A and another band near 8850A, deter¬ 
mine the velocity for many of the redder AGB and RGB 
stars, including those without strong CaT. For some of 
the blue MS stars, Ha and H/? are the only reliable lines 
present, but the Paschen series is also present for young 
supergiants. After the automatic cross-correlation, each 
spectrum is inspected by eye, and only robust velocities 
(those based on at least two strong spectral features) are 
used for kinematical analysis. About 1/3 of the MS stars 
do not pass the quality cut — since hot stars have so few 
spectral features in the optical, it was harder to recover 
a robust velocity. 

The velocity precision varies based on the instrument 
settings that were used. Through 2011, we used the 1200 
line/mm grating on DEIMOS, resulting in an approxi¬ 
mate wavelength range of 6500 — 9100A and a spectral 
resolution of i? = 6000. In 2012, we used the coarser 
600 line/mm grating to gain spectral coverage as blue as 
4500A and better characterize the younger massive up¬ 
per MS stars, yielding a resolution of about R — 2000. In 
both cases, the width of each slitlet was 0."8. The radial 
velocity uncertainties derived from the cross-correlation 
are on the order of a few km s“^ for the higher-resolution 
spectra, and ~ 10 km s“^ for the spectra from 2012. 

The stars in our full spectroscopic sample fall in the 
region of the galaxy dominated by the disk. Based on 


surface brightness profile decompositions (Courteau et al. 
2011 Dorman et al.|20r2), the bulge contributes only 27o 


ot the I band surface brightness at the innermost portion 
of the survey and essentially zero exterior to about 8 kpc. 
Therefore, we do not need to remove bulge stars from the 
sample. 


3. METHODS: VELOCITY DISPERSION AS A FUNCTION 
OF AGE AND OF METALLICITY 

Our goal is to measure the line-of-sight (LOS) velocity 
dispersion of the stellar disk as a function of age and 
of metallicity. In this section, we describe our analysis 
procedure. First, we define regions in the optical CMD 
corresponding to very young MS, younger AGB, older 
AGB, and old RGB populations using a simulated optical 
GMD. Next we split our spectroscopic sample into those 
four bins using the observed optical PHAT CMD. Then, 
we map the line-of-sight dispersion of each component. 
Finally, to look at the old population in more detail, we 
further split the RGB population into two metallicity 
bins and construct a dispersion map for the stars in each 
bin. 


3.1. Definition of age bins 

We first define four age bins in the optical 
F814W/F475W CMD using two criteria: First, we use 
both photometric and spectral discriminants to identify 
stars that are unlikely to be MW foreground (MWFG) 
stars. Second, we use a simple simulated CMD to identify 
regions containing stars of similar ages. We then roughly 
estimate the average ages of our four age bins using the 
simulated CMD and point out that the age estimates of 
the older bins depend significantly on the assumed star 
formation history. 


3.1.1. Foreground contamination 

MWFG dwarfs can lie in the same magnitude window 
as our M31 spectroscopic targets. We took steps, both 
pre- and post-spectroscopy, to eliminate them from the 
catalog. 

For PHAT-selected targets, we avoided likely MW 
members in the target selection stage using UV-IR color- 
color cuts. Here the UV color is ACS F3361F — F4751F 
and the IR color is FllQW — F1601F. A comparison 
of the MW fo reground as simulated by the TRILEGAL 
galaxy model (jVanhollebeke et al.|2009|) and a toy model 
of M31 shows That the foreground dwarfs are exclusively 
bluer in the IR color and redder in the UV color than the 
M31 giant sequence for stars with F4751U — F8141U > 2 
— that is, all of the RGB and AGB stars in our M31 
sample. We use the TRILEGAL MWFG simulation to 
define a box in color-color space to exclude from our spec¬ 
troscopic target selection. 

For the CFHT-selected targets, color information from 
PHAT was not available as a tool for excluding MWFG 
stars at the target selectino stage. Instead, we identified 
red foreground dwarfs based on the presence of the Na 
I doublet at 8190 A, which is an useful giant / dwarf 
discriminant due to to its sensitivity on surface grav¬ 
ity and temperature (Schiavon et al. 1997). We visually 
inspected each spectrum taken between 2007 and 2011, 
and about half of the spectra taken in 2012, for the pres¬ 
ence of the Na I doublet. We also calculated the equiv¬ 
alent width (EW) and the uncertainty on the EW mea¬ 
surement (ctew) across the doublet bandpass (8179-8200 
A) relative to the adjacent continuum (8130-8175A and 


Gilbert et al. 


(2006). The set of stars 


8210-8220A) as in 
with both EW measurements and visual flagging formed 
a training set from which we found a diagnostic that 
can be used to automatically identify definite foreground 
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Figure 1. Spatial coverage of spectroscopic (SPLASH) and photometric (PHAT) surveys from which our data are drawn, overlaid on a 
GALEX UV image of M31 for reference. Magenta regions demarcate the 47 Keck/DEIMOS spectroscopic slitmasks used in the SPLASH 
survey, whereas white rectangles outline the 23 PHAT “bricks” (clusters of HST pointings). In this paper, we use only stars in the 
intersection of these two surveys: those with both PHAT optical photometry and reliable SPLASH-derived radial velocities. 


dwarfs. Figure shows the training set (left) and all 
stars with spectra (right). Stars with EW> 3.2 and sig¬ 
nificance EW/ctew > 8 (those inside the red box) are 
foreground dwarfs and were eliminated from the spectro¬ 
scopic sample. This discriminant was applied to all stars 
in the sample, including PHAT-selected targets that had 
already survived the pre-spectroscopy color-color cut. 

The steps described above do not necessarily eliminate 
every foreground star. To esimate the number of contam¬ 
inants that remain in the sample as a function of optical 
color and magnitude, we employ the TRILEGAL simu¬ 
lation of the Milky Way in the direction of M31. In each 
color/magnitude bin (CM), we calculate the number of 
contaminants Vmw, expected ,cm expected in the SPLASH 
survey: 


VmW, expected,CM = A^SPLASH,CM 


VTrilegal,CM 

A^PHAT,CM 


( 1 ) 


We then subtract the number of contaminants already 
identihed in and removed from that color/magnitude bin 
based on the Na I doublet discriminant: 


A^contaminants,CM — V]v[W,expected,CM Vi.emoved,CM (2) 

Figure shows the resulting distribution of 
Vcontaminants.CM- The number is only significant 
(> 10% of the sample) brighter than mpsidw = 21 


and between 1 < tof 475 w — wpsidw < 2. Stars in this 
portion of the CMD are excluded from the analysis. 
After exclusion, fewer than 0.1% of the stars in the 
sample are expected to be MWFG stars. 


3.1.2. Estimating ages using a simulated CMD 

To estimate the ages of stars as a function of CMD 
position, we simulate a simple stellar population in the 
optical CMD. 

Simulating a CMD requires choosing an age-metallicity 
relation (AMR) and star formation rate (SFR). A few 
measurements from the outer regions of M31’s disk {R ^ 
20 — 30 kpc) using deep HST photometr y are available 
(Brown et al. 2006 Bernard et al. 2012). Both show a 
clear inverse relationship between stellar age and metal- 
licity. For our CMD, we are more interested in separat¬ 
ing the CMD into regions with distinct average ages than 
pinning down those ages precisely. We assume a constant 
SFR of 1 Mq y r~^. This assumption is overly simplistic 
— for example, Bernard et al. (2012) measure a burst of 
star formation in the outer disk at around 2 Gyr ago — 
but is acceptable as we only aim to separate our CMD 
into age bins, rather than measure the precise ages of 
stars in those bins. We estimate the AMR empirically 
from the PHAT RGB da ta in the following w ay: we use 
10 Gyr old PARSEC 1.1 (Bressan et al.|2012) isochrones 
in the metallicity range —2.1 < [M/H\ < 0.3 to esti¬ 
mate the metallicity distribution function (MDF) of all 
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Figure 2. Example spectra from a variety of stellar types. Spectra have been normalized and shifted to rest frame. Top: Representative 
spectra from a young supergiant, young-intermediate age AGB, older and redder AGB, and RGB stars. The two bottom spectra are from 
very red stars, and so their flux on the blue half of the spectrum is very noisy and not shown. Gaps in the spectrum correspond to either 
the DEIMOS chip gap or the atmospheric A-band, whose positions vary from spectrum to spectrum in the rest frame of the star. Bottom: 
Zoom-in views of the portions of the spectrum useful for obtaining velocities. The science spectra are color-coded as in the top panel, 
while template rest-frame spectra used to measure radial velocities via cross-correlation are shown in gray. The wavelengths of Ha and 
the Calcium II triplet are shown in gray vertical lines to aid the eye. (Note that the young (blue) star in the rightmost panel displays the 
Paschen series, not the Calcium triplet, although some of the lines fall at similar wavelengths.) 
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Figure 3. Identifying and removing likely MW foreground dwarfs based on the presence of the surface gravity-sensitive Na I doublet. 
Left: training set based on about 8000 stars that were visually checked for the presence of the doublet. Stars with equivalent widths (EW) 
greater than 3.2 and significance (EW / uncertainty on EW) greater than 8 are were almost universally flagged as dwarfs. Right: Same 
discriminant applied to the full set of spectra (not all of which had been manually inspected). Stars within the red box are almost certainly 
MW dwarfs and were excluded from the analysis. 
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Figure 4. Optical Hess diagram showing the number of fore¬ 
ground Milky Way stars expected in our spectroscopic sample after 
cutting out stars with strong surface gravity-sensitive Nal dou¬ 
blets, which are very likely to be foreground MW dwarfs. The 
blue, black, violet, and red lines outline the MS, youn ger AG B, 
older AGB, and RGB regions described and defined in §|3.1.2| be- 
low. The most-contaminated portion of the CMD - brighter than 
^F814W = 21 and between 1 < mF475W ~ ^F814W < 2 — is ex¬ 
cluded from the analysis. 


of the bright (mF 8 i 4 W < 23) RGB stars in the PHAT 
survey. The AMR is then constructed so as to replicate 
the MDF, assuming a constant star formation rate and a 
metallicity that never decreases with time. For example, 
since there are 7 times as many stars at [M/H] = —0.5 
as at [M/H] = —1, the simulation is allowed to spend 7 
times as long producin g stars with \M/H] = —0.5 as with 
\M/H] = —1. We use|Girardi et al.|(|2010|) isochrones, a 
Kroupa IMF, assume that 357o ot stars are in binaries, 
and apply a constant foreground reddening of Ay = 0.2. 

While this technique ignores the age-metallicity de¬ 
generacy on the RGB, it generates a reasonable GMD. 
The GMD generated using this AMR is very similar to 
one produced usi ng an AMR adapted from the empirical 


one presented in Brown et al. 
HST photometry ot a small 


(|2006|) which used deep 
d m M31’s disk about 


he 

25 kpc from the galactic center. Our CMD assumes a 
constant SFH and thus is not a SFH fit. Due to the age- 
metallicity-extinction degeneracy along the RGB, the as¬ 
sumed SFH may differ dramatically from the true one. 
However, it enables us to select four CMD regions with 
different average ages and measure approximate values 
for those average ages. 

We use the simulated CMD to define four regions con¬ 
taining stars of increasing average age, while avoiding 


the highly contaminated region described in the previ¬ 
ous section. For simplicity, we refer to these bins as 
“MS-k,” “younger AGB,” “older AGB,” and “RGB,” al¬ 
though stars in a given bin do not necessarily all belong 
to the exact same evolutionary stage. The constant-SFR 
CMD and age bin outlines are shown in the left hand 
panel of Figure in the rest of this section, we explain 
the choice of bins. 

All stars bluewards of rnpAibW — mFsiiW = 1 are clas¬ 
sified as “MS-I-” stars. These bright blue stars are pri¬ 
marily massive upper MS members younger than 100 
Myr, although some may be blue supergiant (BSG) 
stars, which have similar ages. The RGB region in¬ 
cludes stars redwards of the line that passes through 
(tof475W — WF814W, wf814w) = (2,23) and (2.7,20.4) 
and fainter than the tip of the red giant branch (TRGB). 
The blue limit is chosen by eye so as to minimize con¬ 
tamination with young MS stars, which overlap the old¬ 
est RGB stars in high-metallicity systems such a s M31 . 
We measure the TRGB using the Bressan et al. (20121 


isochrones described earlier. The brightest RGB stars 
on these isochrones trace the TRGB as a function of 
A4751H - F814VF color. 

We classify most of the rest of the red side of the CMD 
as AGB. To avoid contaminating the AGB bins with 
older RGB stars, we do not use stars less than 0.1 magni¬ 
tude brighter than the TRGB (those within photometric 
scatter of the TRGB). 

Stars of a large range of ages, from a few hundred Myr 
to several Gyr, can lie on the AGB. At a given metallic¬ 
ity, younger AGB isochrones are brighter and bluer than 
older AGB tracks. In the simulated CMD, age roughly 
tracks luminosity, with younger stars towards the top of 
the CMD. We use this age-luminosity dependence to split 
the AGB in half (along the line tof 814 W = 20.5) into two 
age bins: “younger AGB” and “older AGB.” 

The final classification scheme into four age bins is 
shown in the left-hand panel of Figure]^ The MS-1- bin 
is outlined in blue on the blue side of the optical CMD, 
RGB stars in red below the TRGB, older AGB stars in 
purple brightward of the TRGB, and younger AGB stars 
in black. The right hand panel shows the age distribu¬ 
tions of stars in these bins. The age distributions are 
broad and overlap, but have increasing mean ages: The 
MS-(- bin has a mean age of 30 Myr, while the younger 
AGB, older AGB, and RGB stars have average ages of 
0.4, 2, and 4 Gyr, respectively. The RGB bin has a low 
average age for two reasons: First, because we imposed 
an age-metallicity relation, the older, bluer metal-poor 
RGB stars actually overlap in CMD space with young 
red Helium burning stars, while younger, metal-rich RGB 
stars do not suffer from this ambiguity. Therefore we use 
only the younger (redder) portion of the RGB in this 
work. Second, and more importantly, the RGB age dis¬ 
tribution in any magnitude limited sample with a con¬ 
stant SFR is biased towards younger ages since the rate 
of stars moving off the main sequence onto the red giant 
branch is higher for younger stars. 

For comparison, we also generate a CMD with an ex¬ 
ponentially decreasing SFR with timescale t = 4 Gyr. 
This SFR is much steeper (skewed towards older stars ) 
than seen in the outer disk of M31 (|Bernard et al.|2012), 
but gives an interesting boundary case. The boundaries 
of the most reasonable four age bins in CMD space are 
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the same as for the constant-SFR simulation, but the av¬ 
erage ages of the older age bins increase: the older AGB 
has an average age of 3.5 Gyr and the RGB has an av¬ 
erage age of 5.5 Gyr. We do not show this GMD here 
since it does not influence our choice of age bin bound¬ 
aries, but we discuss both sets of age estimates later in 
the paper. 

3 . 2 . Separation of data into age bins 

We divide the stars in the kinematical sample into the 
four age bins defined in the previous section, using optical 
PHAT photometry. 

The left hand panel of Figure]^ shows an optical Hess 
diagram of all stars in the PHAT survey in a representa¬ 
tive region of the galaxy. Spectroscopic targets fall into 
the color/magnitude range outlined in green. The right 
hand panel of Figurej^shows a GMD of the spectroscopic 
sample only, divided into age bins as in Figure 

While our simulated GMD does include reddening from 
the MW foreground, it does not account for differential 
extinction by dust within the disk, and we do not at¬ 
tempt to account for this shortcoming in classifying the 
data into age bins. This means that some of the GMD re¬ 
gions may be contaminated by stars from bluer age bins. 
However, the direction of the reddening vector is such 
that only two bins are likely to be contaminated: A few 
younger AGB stars may be reddened into the older AGB 
region. The RGB region is largely immune from contam¬ 
ination due to reddening, since the shape of the TRGB is 
such that old AGB stars will never be reddened into the 
RGB region, and the other two bins are far enough away 
in GMD space that their members will not be reddened 
onto the RGB either. Additionally, the red helium burn¬ 
ing stars, which are not included in any of our bins, can 
be reddened into the RGB bin; however, these are much 
smaller in number than the RGB stars and thus largely 
insignificant even with reddening. However, the region 
in the GMD occupied by RGB stars contains stars of a 
range of ages, as discussed earlier and pictured in Fig¬ 
ure El 

We estimate the average reddening vector in our sam¬ 
ple using the M31 dust map presented in Dalcanton et al. 
(2014). This map, constructed by comparing the infrared 
colors of RGB stars in the PHAT survey to the unred¬ 
dened RGB, gives the average Ay as a function of loca¬ 
tion across the disk. Only stars that lie behind the dust 
layer are affected. The average reddening vector at the 
locations of our spectroscopic targets, assuming half the 
targets lie behind the dust layer, is shown in Figure]^ 

3 . 3 . Velocity dispersion maps 

We now map the LOS velocity dispersion of stars in 
each of the four age bins using a smoothing technique and 
display the result in two ways: as a 2-D sky map (Fig¬ 
ure and as a 1-D dispersion distribution (Figure [^. 
We ao not fit separately for the radial, azimuthal and 
vertical components of the dispersion. Such a decompo¬ 
sition requires assumptions on the rotation curve and the 
geometry of the disk, and we choose to keep our analy¬ 
sis purely empirical. A future paper will examine the 
shape of the velocity ellipsoid. For reference, because of 
M31’s nearly edge-on inclination, the vertical component 
of the velocity dispersion has negligible contribution to 


the LOS dispersion anywhere on the disk. In general, 
the LOS dispersion is a combination of the radial and 
azimuthal dispersion components, though the azimuthal 
component dominates near the major axis where most of 
the survey field lies. 

For each target, we measure the weighted second mo¬ 
ment of the velocity distribution of all neighbors that 
belong to the same age bin and also fall within some ra¬ 
dius of that target using the maximu m likelihood method 
described in Pryor & Meylan (1993). The weights are the 
inverse square of the velocity measurement uncertainty. 
This technique takes into account random scatter from 
individual velocity measurement uncertainties, and also 
allows a straightforward computation of the uncertainty 
on the velocity dispersion estimate as long as the number 
of points is at least ~ 15 — 20. 

To create 2D dispersion maps, displayed in Figure 
and Figure the smoothing radius is fixed to 200" for 
the MS and RGB bins and 275" for the less densely pop¬ 
ulated AGB bins. Stars with fewer than 15 neighbors are 
dropped from the sample, as their dispersions and asso¬ 
ciated uncertainties are unreliable. These stars are still 
available to serve as “neighbors” for nearby targets, but 
the dispersion measurements centered on them are not 
used. Using smaller smoothing lengths increases spatial 
resolution in densely packed regions of the survey area, 
but results in many points being cut because they do not 
have enough neighbors to yield reliable dispersions and 
uncertainties. Using larger smoothing lengths further re¬ 
duces spatial resolution in the maps, reducing the con¬ 
trast between adjacent high-and low-dispersion patches. 

To create the ID dispersion distributions in Figure 
the smoothing radius is chosen independently for each 
target such that the uncertainty on the velocity disper¬ 
sion is constant within an age bin. A constant uncer¬ 
tainty makes it easier to understand the spread in the 
dispersion distribution due to measurement uncertainty. 
We arbitrarily choose uncertainties of 5 km s“^ for the 
MS-b bin, 7 km s“^ for the RGB stars, and 10 km s“^ 
for the AGB stars. Requiring a constant uncertainty 
means that stars in regions of lower target density or 
with larger individual measurement uncertainties have 
larger smoothing circles. Again, dispersions measured 
using fewer than 15 neighbors are dropped. These uncer¬ 
tainty choices result in an average smoothing circle size 
of about 200" for all age bins. If we allow the dispersion 
uncertainty to be larger, the dispersion distributions are 
smeared out too much to see individual features. If we 
cap the uncertainty at a small value, too many points are 
removed due to small numbers of neighbors. 

It is clear from both the 2D dispersion map in Fig¬ 
ure [T] and the ID dispersion distributions in Figure 
that the typical dispersion increases with average age - 
that is, older populations are dynamically hotter than 
younger populations. In addition, the 2D maps appear 
to be patchy, but we will show in § |4.3| that much of the 
small-scale spatial variation in dispersion is an effect of 
finite sampling and is not physical. 


3 . 4 . RGB velocity dispersion maps as a function of 
metallicity 

We also examine how the RGB dispersion map varies 
with metallicity. 

We estimate the metallicities of the RGB stars by in- 
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Figure 5. Left: Simulated CMD assuming a constant star formation rate. The blue, black, violet, and red outlines define the “MS,” 
“younger AGB,” “older AGB,” and “RGB” age bins, respectively. Stars are color coded by log(age), with brighter (yellower) colors 
corresponding to older ages. Numbers correspond to the average age of stars within each bin. Right: Age distributions for the same four 
regions. Age bin colors are the same as the left panel and throughout this paper. 

We can now measure, for the first time, the age-velocity 
dispersion relation in an external galaxy. Figure [T^ shows 
the average velocity dispersion of stars in our four age 
bins versus their average ages, as estimated from the 
simulated CMDs. The solid black line shows the best fit 
to the points assuming a constant SFR, and the dashed 
black line assuming a decreasing SFR. For comparison, 
we also show the age-dispersion relation for F and G 
dwarfs in the solar neigh borhood of the Milky Way from 


terpolating on the grid of 10 Gyr-old Bressan et al. (20121 
isochrones described in Section 3.1. The resulting metal- 
licity distribution ranges from —2.3 < [M/H] < 0.3 
and is strongly skewed towards high metallicities, with 
a peak at [M/H] = —0.2. We then split the RGB 
sample approximately in half, into a high-metallicity 
{[M/H] > —0.25) and a lower-metallicity bin. The stars 
in our RGB bin are not all 10 Gyr old; the age-metallicity 
degeneracy on the RGB means that the metal-poor bin 
has a slightly higher average age than the metal-rich bin, 
although there is significant overlap in the age distribu¬ 
tions. When we divide the RGB stars in the simulated 
CMD into the same two bins by position in the CMD, the 
low-metallicity bin has an median age of 2.7 Gyr and a 
mean age of 4.9 Gyr, with a broad, relatively flat age dis¬ 
tribution. Meanwhile, the high-metallicity bin is slightly 
younger with a median age of 2.4 Gyr and a mean age of 
3.5 Gyr. For each bin, we construct a smoothed velocity 
dis persi on map and a dispersion histogram as described 
in § |3.3[ The maps are displayed in Figure and the 
ID distributions in Figure El As before, for ease of in¬ 
terpretation, the smoothing circle radius is held constant 
at 200" for the maps, and is allowed to vary in order to 
reach a target dispersion uncertainty of 7 km s“^ for the 
ID distributions. 


4. RESULTS 

4.1. Age-dispersion relation 


Nordstrom et al. (20041. We show both the an and a^ 
components ot tlie Milky Way’s velocity ellipsoid, a^ is 
the best comparison to our major-axis-dominated M31 
data set, but in general the LOS dispersion at any loca¬ 
tion is some combination of the two components (with a 
negligible contribution from the vertical component CTj,). 
The difference in both slope and normalization is strik¬ 
ing. The dispersion of M31 stars increases with age more 
than 3 times faster than the dispersion of MW stars in 
the case of the exponentially decreasing SFR, and five 
times faster in the case of the constant SFR. Addition¬ 
ally, the average dispersion of the RGB bin (90 km s“^) 
is nearly three times as high as the old est, hottest popu¬ 
lation probed in the MW thin disk by [Nordstrom et al.' 
(20041: 10 Gyr stars with a^ ^ 32 km s“^. The MW’s 
metal-poor thick disk is slightly more disturbed, but still 
50% cooler than M 31’s disk, at an = 60 km s“^ (Buden-| 
bener et al.||2014 ). 


This age-dispersion correlation is robust against con¬ 
tamination of our age bins. Two bins — the RGB 
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Figure 6. Left: Optical PHAT CMD from Brick 12, a representative brick near the middle of the PHAT survey area. The green box 
marks the portion of the CMD sampled by the spectroscopic survey. The age bin region outlines are the same as in Figure fm Right: 
Optical PHAT CMD of spectroscopic targets in the PHAT survey region that have optical PHAT photometry and reliable velocities and 
are unlikely to be foreground MW stars based on the strength of their Na I doublet lines. The age bin region outlines are the same as in 
the left hand panel and in Figure^ Stars that fall outside the age bin boundaries have ambiguous ages and/or may be foreground stars, 
are not used in this work. The rea arrow shows the median reddening vector due to dust in M31’s disk, as measured by Dalcanton et al. 
(2014), ass uming that half of the spectroscopic targets lie behind M31’s dust layer. Photometric errors are typically less than 0.01 mag in 
each filter ([Dalcanton et al.|2012|l. 


and older AGB — are probably contaminated by a few 
younger stars. The RGB age bin consists primarily of old 
stars, but also includes some higher-mass, intermediate- 
age stars. This contamination biases the RGB disper¬ 
sion distribution towards that of the faint AGB; in other 
words, an exclusively old population would be at least 
as dynamically hot as our RGB bin. The other af¬ 
fected bin is the older (fainter) AGB group, contami¬ 
nated by younger AGB stars that have been reddened 
by dust in the disk of M31. Again, this contamina¬ 
tion biases the faint AGB dispersion distribution towards 
smaller values, so that the difference in typical disper¬ 
sions between uncontaminated young-intermediate and 
older-intermediate age populations is at least as big as 
the one we report in Figure!^ 

The high dispersion of MSTs stellar disk is also robust 
against “smearing” of the LOS component of the rota¬ 
tion velocity r’DiskLOS within the finite-sized smoothing 
circles. To confirm this, we conduct a test in which we 
map the dispersion in {v — UDiskbOs) instead of just v. 
Here, UDiskLOS is computed assuming that the inclina¬ 
tion i = 77° and major axis P.A= 38° over the entire 
disk, and that the rotation velocity of the disk UDiskRot 
is a constant within each age bin. v^iot is calculated by 
fitting to the deprojected velocity field of stars in each 
age bin, and comes out to (260, 250, 220) km s“^ for the 


(MS-k, AGB, RGB) stars, respectively. The resulting 
dispersion in (u —tiDiskLOs) corresponds to a median cor¬ 
rection of just 0.5% for the RGB dispersion, 1% for the 
AGB dispersion, and 6% for the MS-k dispersion. 

We will discuss the evolutionary implications of the 
age-dispersion relation in more detail in §[^ 

4 . 2 . Age-metallicity relation among RGBs 

Figures [T0| and [TT] show that the metal-poor RGB pop¬ 
ulation is dynamically hotter than the metal-rich popu¬ 
lation, by a factor of almost 50%. The metal-rich compo¬ 
nent has kinematics only about 15% hotter than the older 
AGB population. It is dynamically hot (cr„ ^ 90 km s“^) 
in two spots on the disk. It should be noted that the 
metal-rich bin likely contains some reddened metal-poor 
stars; this may explain the broad dispersion distribution 
of the metal-rich bin. 

4 . 3 . Structure in dispersion map 

The spatial dispersion maps in Figure show that the 
the dispersion of all four age bins varies across the face 
of the disk. 

One of the most obvious features in all four age bins is 
a kinematically hot patch about 6 kpc from the center of 
the galaxy on the major axis. This patch is marked by 
a teal line and outlined in both panels of Figure We 
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Figure 8. RGB dispersion map from Figure[^next to a Herschel image of M31 for reference. A high-dispersion region which we term the 
“Brick 9 Region” is outlined in teal in each panel. Th e Brick 9 region aligns with th e end of the inner ring, also roughly consistent with 
the end of the long bar accordi ng to the simulations by|Athanassoula &; Beaton| (|2006||. The estimated size and orientation of the bar from 
[Athanassoula Sz Beato^ l |2006[ | is marked with a white line. 
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Figure 9. Local line-of-sight velocity dispersion distributions of 
stars in the four age bins for the entire survey area Top and only 
stars in the Brick 9 region outlined in Figure Bottom. For 
this plot, smoothing circle sizes were chosen irmependently for 
each point such that the uncertainty in velocity dispersion was 
7,10, and5 km s“^ for the RGB, AGB, and MS age bins, respec¬ 
tively. Dispersion distributions are constructed from the data using 
a kernel density estimator, with optimum bandwidth chosen using 
Silverman’s method. In the galaxy as a whole (top panel), average 
dispersion increases with average age. The B9 region has signifi¬ 
cantly hotter kinematics than the rest of the galaxy. 


dub the region enclosed by the black box the “Brick 9 
region,” since it is centered on the same area as the set of 
HST pointings known as “Brick 9” in the PHAT survey 
tiling pattern. The two panels in Figure contrast the 
dispersion distributions for the full survey and for the 
brick 9 region. For every age group, the dispersion of 
stars in the Brick 9 region alone is much higher than for 
the overall survey. 

In §[^ we will explore the possible relationship between 
this hot patch and M31’s bar. 

Outside of the Brick 9 region, the dispersion varies 
with position (is “kinematically clumpy”), although the 
dispersions of different populations do not necessarily fol¬ 
low the same spatial pattern. Much of this dumpiness 
is simply a result of low spectroscopic target density, as 
we now show using a modified version of the disk toy 
model described earlier. We construct and analyze the 
toy model for the RGB disk as an example; similar exper¬ 
iments could be run for the younger stellar populations 
as well. 

We construct a toy model of a disk by scattering N 
stars uniformly in a volume 0.8 kpc thick whose base is a 
circle of radius 20 kpc. The velocity vectors of the stars 
are drawn from a 3-dimensional random normal distribu¬ 
tion corresponding to a rotation velocity of 220 km s“^ 
and a velocity dispersion of 90 km s“^ (the average ob¬ 
served RGB dispersion) in each direction. (While it is 
unlikely that the velocity ellipsoid of M31’s disk is actu¬ 
ally isotropic, this choice does not affect the qualitative 
results here.) We choose N such that it most accurately 
reproduces the sampling density of the SPLASH survey. 

We then sample from our toy disk exactly as we do 
in our spectroscopic sample: at the location of each 
SPLASH target, we compute the LOS velocity disper¬ 
sion of all neighbors within a 200" radius. The result¬ 
ing dispersion map, shown in Figure [TSl is kinematically 
clumpy. In Figure |14[ we quantify the dumpiness and 
compare the simulation’s ID dispersion distribution with 
M31’s RGB dispersion distribution. The two distribu¬ 
tions have similar full widths at half maximum. To test 
the effect of sampling density, we also run a simulation 
with a number density five times higher. The resulting 
dispersion distribution is only half as wide as the origi¬ 
nal simulation, corresponding to a less patchy dispersion 
map. 

Why does finite sampling density cause patchiness in 
the dispersion map? With few samples, velocity out¬ 
liers (those on the tails of the velocity distribution) will 
be scarce and will not fall into every smoothing circle. 
Velocity dispersion estimates from different smoothing 
circles may be inflated or deflated based on how many 
velocity outliers they happen to include. The average dis¬ 
persion measured accurately reflects the input (intrinsic) 
velocity distribution (90 km s“^ in this case), but the 
spread in dispersions from star to star is an artifact of 
low sam plin g density. 

Figure [T4| shows that M31’s dispersion distribution has 
large higmand low-dispersion tails in excess of the sim¬ 
ulation — that is, there are dynamically hot and cold 
patches that cannot be explained by finite sampling den¬ 
sity. This excess dumpiness is not a result of patchy 


dust: the spatial distributions of dust (Dalcanton et_al. 
20141 and velocity dispersion are uncorrelated. In Jj[^we 
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Figure 10. Smoothed velocity dispersion s of RGB star s in h igh- (Left) and lower-metallicity [Right) portions of the RGB sample. 
Metallicities are measured using 10 Gyr old |Bressan et al.| l|2012|| isochrones. On average, the metal-rich population has a lower velocity 
dispersion. Some of the hot patches in the hign-metallicity bin may be due to contamination from reddened low-metallicity stars. 


will argue that the dynamically hot patches within each 
age bin are regions where a few stars from a second kine- 
matical component (for example, a bar, halo, or tidal 
debris) are overlaid on the uniform underlying disk. 

5. DISCUSSION 

5.1. High-dispersion population: superposition of two 
kinematical eomponents 

Thus far, we have shown that the second moment of the 
local velocity distribution (LOSVD) — the local line-of- 
sight velocity dispersion — varies substantially between 
age bins and also across the disk of the galaxy, and that 
much of the variation across the disk can be explained 
by finite target density in the spectroscopic survey. Here, 
we show that the shape of the LOSVD within each age 
bin changes with dispersion in a way that cannot be ex¬ 
plained by sampling density alone, but can be explained 
by the presence of a second kinematical component su¬ 
perimposed on a uniform disk. 

If some regions do have intrinsically higher velocity 
dispersions (not caused by sampling effects), it is not 
immediately clear whether the high-dispersion patches 
contain a single dynamically hotter disk component, or 
whether they contain a higher fraction of stars from a 
second kinematical component (such as a spheroid, bar, 
or tidal stream debris). The two scenarios can be dis¬ 
tinguished by the shape of the velocity distribution: If 
we assume that the local LOSVD of the disk is Gaussian 
(which is a reasonable approximation in a galaxy with the 


rotation speed of M3I), the former scenario would pro¬ 
duce a wide symmetric Gaussian distribution, whereas 
the latter would produce a superposition of two distribu¬ 
tions: a narrow central distribution with a wide tail to 
one side. 

Here we test the symmetry of the velocity distribution 
of low-and high-dispersion patches in the RGB age bin. 
We repeat the same process for the other three bins. 

We first split the RGB velocity field into nonoverlap¬ 
ping square pixels, 1 kpc on a side. Then, for each pixel, 
we zero and normalize the velocity distribution by sub¬ 
tracting the mean and scaling by the standard deviation 
of velocities within that pixel: x = {v — i;)/std(r!). We 
now stack all of the scaled velocity distributions x. If the 
local LOSVD were always Gaussian, the stacked distri¬ 
bution would be a unit normal symmetric about a: = 0. 

Figure [T5| compares the scaled, stacked velocity distri¬ 
butions and a unit normal for RGB stars in pixels with 
lower dispersions (less than 100 km s“^; top panel) and 
higher dispersions (bottom panel). The velocity distribu¬ 
tions are skewed to negative velocities, which is towards 
M31’s systemic velocity on this redshifted side of M31’s 
disk. The stacked distribution composed of the low- 
dispersion pixels is nearly symmetric about zero. How¬ 
ever, the high-dispersion velocity distribution is skewed: 
while its mean is at a: = 0 by construction, its mode is at 
a positive velocity and it has a wide tail towards negative 
velocities. The same trend is seen, to a smaller degree, 
in both the AGB bins, but not in the MS-|- bin. 
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Figure 11. Same as Figu re [9] for the RGB stars in the two metal- 
licity bins from Figure [To| The top panel includes all stars, whereas 
the bottom panel only includes stars in the Brick 9 region. The 
metal-rich population is dynamically colder on average, and its 
dispersion is higher on the B9 region than off. 



Figure 12. Comparison of dispersion versus age relations for M31 
and the MW solar neighborhood. Large squares: age-dispersion 
relation for four M31 age bins. Ages are the mean ages from 
the simulated CMD; dispersions are the mean values from Fig¬ 
ure 1^ Large solid squares and large open squares correspond to 
ages measured from the constant SFR and decreasing SFR simu¬ 
lated CMDs, respectively. The solid and dashed lines are the best 
fit to the constant SFR and decreasing SFR data points, respec¬ 
tively. Though this is a very rough comparison, the dispersion 
of a population appears to be a monotonic function of its average 
age. The orange circles and pink squares trace, respectively, the ra¬ 
dial and azimuthal dispersion-ag e profiles for F and G d warfs from 
the Geneva-Copenhagen survey (|N6rdstrom et al.|2004|l. Both the 
slope and normalization of the dispersion versus age relation are 
much higher in M31 than in the MW, regardless of the assumed 
SFR. 


The skew in the high-dispersion pixels is notably ab¬ 
sent when the same analysis is run on the simulated 
toy disk from §4.1. In that case, both the high- 
and low-dispersion pixels have symmetric (that is, one- 
component) LOSVDs. 

In the data, the lower-dispersion pixels always have 
approximately symmetric velocity distributions, sugges¬ 
tion that they are also dominated by stars from a single 
kinematical component and that the median dispersions 
are reliable indicators of the width of the LOSVD of that 
component. For these lower-dispersion pixels, the corre¬ 
lation between average dispersion and age does indeed 
reflect a real trend in the degree of heating of the dom¬ 
inant disk. However, within an age bin, the dispersion 
in dynamically hotter patches are inflated due to a con¬ 
tribution from a few low-velocity stars, rather than to 
a single, dynamically hot disk component. The spatial 
variation in dispersion simply reflects inhomogeneity in 
the number density of this second population. 

The spatially inhomogeneous dynamically hot compo¬ 
nent has been identified before: first, we identified and 
mapped the population of stars whose high velocities rel¬ 
ative to the local disk LOS velocity rendered the m likely 
’’inner spheroid” members (Dorman et al. 2012). Then, 
we found that there are rhore kinematically-identified 
inner spheroid members than can be accounted for by 
the disk-dominated surface brightness profile, and that 
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Figure 13. Velocity dispersion map for a toy model of a disk 
with a Gaussian velocity distribution with = 220 km and 
an = (T<^ = 90 km s“^. 


the spheroid members have a luminosity function nearly 
identical to that of the disk (Dorman et al. 2013). The 
high-dispersion regions identihed in this paper are sim¬ 
ply regions that include some of these ’’inner spheroid” 
stars. The nonuniform spatial distribution of this popu¬ 
lation could easily be produced by localized effects such 
as satellite accretion or disk heating via interactions with 


the ba r or with satellites (as prop osed in Dorman et al . 
(2013)) among other things. As in Dorman et al. 1 2013b 
the hot component is only distinguishable from the rest 
of the disk via kinematics, not photometry: RGB stars 
that form the low-velocity tail have the same distribu¬ 
tion in color-magnitude space as the rest of the RGB 
population. 

The hot kinematical component inflates the smoothed 
velocity dispersion in certain pixels above that of the 
underlying disk, and therefore complica tes the compar¬ 
ison to the Milky Way data in Figure [l^ The Milky 
Way data from the Geneva-Copenhag en survey sample 
only stars within ^ 40 pc of the Sun (Nordstrom et al. 


2004); since the scale height of old st ars in the Galaxy' s 


2012 ), 


thin disk is several hundred pc (e.g., Bovy et al. 
the GCS data set is strongly dominated by disk stars 
and includes only minimal contribution from a spatially 
extended, dynamically hot halo. If the hot population 
in M31 is much more vertically extended than the un¬ 
derlying colder component, then part of the steep slope 
of the M31 age-dispersion relation may be due to ’’con¬ 
tamination” from a halo-like component that wasn’t even 


Figure 14. Velocity dispersion distribution for RGB stars from 
the toy model of a disk with target density comparable to our 
spectroscopic survey (black dashed line), from a toy model with 
target density 5 times higher (blacked dotted line), and from the 
M31 data (red line). Finite sample density can account for most 
of the width of the observed dispersion distribution. 


probed in the MW survey. However, even in this case, the 
qualitative point that MSf’s disk is hotter than the MW’s 
still stands: if we eliminate the likely ’’hot component” 
stars (those on the low-velocity tail of the high-dispersion 
patches in Figure 15), the remaining RGB population 
still has an average dispersion of 80 km s~^, twice as 
high a s in the oldest MW stars in the Nordstrom et ah] 
(2004) sample. 


5.2. Disk evolution scenarios 

In this section, we discuss possible disk formation and 
evolution scenarios’ ability to reprodnce the observed fea¬ 
tures of M31’s stellar disk: a positive correlation between 
age and velocity dispersion, a negative correlation be¬ 
tween RGB metallicity and velocity dispersion, a high 
overall dispersion, and the skewed LOSVD in the high- 
dispersion patches. Dynamically hot populations spa¬ 
tially coincident with a galaxy disk can arise from three 
broad categories of processes: dynamical heating of an 
initially thin disk; in situ formation of a thick disk at 
high redshift; or accretion of satellite debris onto the 
disk. In this section we walk through the three scenar¬ 
ios. We do not attempt to choose the “correct” scenario; 
that will require more detailed modeling. However, we 
show that we can explain all of the observed features 
with a combination of some mechanism to create an age- 
dispersion relation (either thin disk heating or thick disk 
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Figure 15. Comparison of composite velocity distribution shapes 
for RGB stars (solid red curves) to a normal distribution (dashed 
black curves). The top panel includes pixels with dispersion less 
than 100 km s~^, and the bottom panel includes the rest of the 
pixels. By construction, all distributions have the same mean of 
0 and variance of 1. In each panel, the modes of the LOSVD and 
normal distributions are marked by the solid and dotted vertical 
lines, respectively. The velocity distribution of the high disper¬ 
sion pixels (bottom panel) has a significant low-velocity tail that 
is barely present in the low-dispersion distribution. The inflated 
dispersion of the dynamically hot patches in the RGB dispersion 
map, then, is due to a contribution from a few low-velocity stars in 
a second kinematical component (spheroid or tidal debris), rather 
than to an intrinsically hotter underlying disk. Though not shown, 
the same trend is seen in the AGB bins, but not the MS bin or in 
the simulated disk. 

collapse) plus some mechanism to heat the disk nonuni- 
formly (such as accretion of satellites or kicking up of 
existing disk stars via satellite impacts). 


5.2.1. Heated thin disk 

The canonical explanation of thick disk formation in¬ 
volves heating an initially thin disk via perterbations 
from internal structures such as a bar, spiral arm s, or 
giant molecular clouds (GMCs) (e.g., Sellwood 2014 and 
references therein); N-body simulations suggest that im¬ 
pacts from sat ellite galaxies can a l so significantly heat an 
existing disk (fPurcdl et al.||2009 McCarthy et al.||2012 


Tissera et al. ]2U13p. It these heating processes happen 


over an extended period of time, then the heating a stel¬ 
lar population has undergone should correlate with its 
age, consistent with the trend we see in Figuresand 


[T^ It is unclear whether internal heating from CMC 
scattering or spiral arm perturbations could inflate dis¬ 
persions by 60 km s“^ over a period of a several Gyr, but 
stars kicked out of the disk as a result of sa tellite impacts 
can reach dispersions of over 100 km s~^ ( Purc ell et al. 


2010 McCarthy et al.|[2012 Tissera et al.||2uJ5|. 


The young stars m our sample have a velocity disper- 
sion > 50% higher than the young stars in the Milky Way 
(Nordstrom et al.||2004). If disk evolution is entirely due 
to heating of an initially thin disk, then M31’s disk must 
have already been born with a higher dispersion. 

Heating from several discrete satellite impacts can ex¬ 
plain the skewed LOSVD in patches in the dispersion 
map. 


5.2.2. Progressive collapse of a thick gas disk 

An alternative to the heated-thin disk scenario de¬ 
scribed above involves the collapse of an initially thick 
gas disk. At early times, a thick, clumpy disk of gas 
can form a thick disk of stars. The remaining gas later 
collapses into progressively thinne r, dynamically colder 


layers, forming star s along the way (Bournaud et al. 2009, 
Forbes et al.||2012). In this way, younger (more recently 


formed) stars he in a colder, more metal-rich disk than 
older stars. Such a scenario produces a continuous age- 
dispersion correlation. Combined with a small amount 
of heating from satellite encounters and radial migration, 
it can approximately reproduce both the magnitude and 
slope of the age-dispersion relationship i n the disk of the 
Milky Way ( [Bird et al |[M^ |Bird||2015[ ). 

While this disk settling formation mechanism can ex¬ 
plain the continuous nature o f th e age-dispersion correla¬ 
tion in M31 as seen in Figure [T^ it cannot reproduce the 
steepness of the age-dispersion trend (16 km s“^Gyr“^ 
for the constant SFR or 9 for the declining r = 4 Gyr 
SFR) an d the very high velo city dispersion. The fiducial 
model in Forbes et al. (2012|) shows that, at 3 radial scale 
lengths (K ~ 15 kpc in M31), the dispersion in a MW- 
mass galaxy with a decreasing gas accretion rate and 
a standard radial migration prescription increases only 
7 km s“^ Gyr“^ from 0 to 2 Gyr — half that of M31 as¬ 
suming a constant SFR, and still 20% less than in M31 in 
the case of the heavily old star-weighted declining SFR. 

Similarly, additional heating is need to explain the ex¬ 
tremely high velocity dispersion, which, for the RGB 
stars, is 40% of the circular velocity of 250 km s“^ 

(Chemin et al. 2009 Gorbelli et al. 2010|. In contrast, the 
highest stell ar dispersion in models presented in [Forbe^ 
et al. (2012) is 40 km s“^, or < 20% of the circular ve¬ 
locity. 

This scenario also explains the inverse correlation be¬ 
tween metallicity and velocity dispersion for the RGB 
stars, assuming that the more metal-rich RGB popula¬ 
tion is the younger one. 


5.2.3. Satellite accretion 


Cosmological simulations predict that stellar halos are 
built up via hierarchical merging with smaller galaxies, 
and the promine nt substructure in the outskirts of the 


Milky Way (e.g., Mathewson et al. 1974| 

Yanny et al. 

20031 IRocha-Pinto et al.||20031 llbata et al.| 

19941 IMaiew- 

ski et al. 112003 Newberg et al. 2003) an 

M3l ( 

Ibata 

et al. 20041 

20071 b'erguson et al.| 2002 

Kalirail 


Gilbert et a 

. 2001 

McGonnachie et al. 

2009 'I'anaka 
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Dorman et al. 


et al. 12010 Gilbert et al.|20l'2 1 lend support to this idea. 


Because the total area ot the disk is smaller than that 
of the extended halo, satellite-disk interactions should 
be rarer tha n sate llite-halo interactions, but not nonex¬ 
istent. In § |5.2.1[ we discussed how these interactions 
can heat the existing disk. But stars from the satellite 
galaxies can also accrete onto the disk, changing the age 
and metallicity distributions of the disk in addition to 
dynamically heating it. Debris from at least one tidal 
disruption event has been found across the face of M31. 
The Giant Southern Stream, Northeast Shelf, and West- 


1 Ibata et al 

112001 

Kalirai||2006 

Gilbert et al. 12009 

Dor- 

man et al. 

201 !i ) 

Accretion from satellites can natu- 


RGB population. Since the velocity of the satellite would 
likely be quite different than the rotation velocity of the 
disk, regions with accreted stars would end up with in¬ 
flated line-of-sight velocity dispersions, producing local¬ 
ized structure. However, accretion alone cannot account 
for the continuous increase in dispersion with age. It also 
cannot account for the high dispersion of the young MS-b 
stars relative to the young stars in the MW unless M31’s 
young stars were born with a higher dispersion. 


5.3. Brick 9 Feature: Associated with the Bar? 

Many of the high-dispersion measurements lie in the 
“Brick 9 Region” centered at (^,7?) = (4.5,4.5) kpc, 
marked by the ends of the teal line in Figurej^ T his high- 
disper s ion re gion was identihed in RGB s tars in |Dorman| 
et al. (20121 and Dorman et al.| (|2013|) as having the 
highest traction of “inner spheroid” or ‘ kicked-up disk” 
stars. Here, we see that it is not limited to the RGB 
sample. The Brick 9 feature is present in all age bins 
and in both RGB metallicity bins, suggesting that it is 
not an accreted population or formed by an event that 
ended more than a Gyr ago. It is located on the major 
axis of the galaxy, so is not subject to the geometric dis¬ 
persion inflation that happens near the minor axis. It 
does happen to be located on the inner ring (visible in 
the Herschel image in the lower right panel of Figure]^ 
a nd may be coincident with the end of a long bar. 


Athanassoula & Beaton (2006) compared the infrared 
isodensity contours and radial luminosity profiles to N- 
body simulations of barred galaxies to show that M31 
likely has a bar that may extend out to 1300" (5 kpc) at 
a position angle of 45° — placing its NE end just inte¬ 
rior to the Brick 9 feature. The effect of bars on stellar 
kinemat i cs is not well understood, but [Athanassoula 
Beaton (2006) show that the presence of a bar, when 
misalighed with the major axis of its host disk, can re¬ 
sult in a broad velocity distribution near the end of the 
bar. Typically, bar members have different orbits from 
disk members, and so smoothing circles containing both 
bar and disk members would contain a large spread in 
line-of-sight component of stellar velocities. 


5.4. Comparison to Milky Way 

The work presented in this paper shows that M31’s 
stellar disk is dynamically hotter than the MW’s: the 
average dispersion of the RGB bin (90 km s“^) is nearly 
three times as high a s the oldest, hot t est p opulation 
probed in the MW by Nordstrom et al. (2004), and the 


average dispersion of our youngest age bin is more than 
50% higher than that in the Milky Way. In addition, the 
slope of the dispersion versus age relation is more than 3 
times higher in M3I than our own galaxy. Both suggest 
that M31 has had a more violent accretion history than 
the MW in the recent past. 

This result is encouraging for ACDM cosmology, which 
predicts that 70% of disks the size of those in the MW 
and M31 should have interacted with at least one satel- 



possibility that ACDM predictions are correct and that 
the MW is simply an outlier with an unusually quiescent 
history. 

Our results support a growing body of evidence that 
M31 has experienced a more violent merger history than 
our own galaxy. M31’s halo is littered with sub struc- 
ture that can be identifie d both in photometry ([ Iba^ 


et al.| 2014) and kinematics (Chapman 


20081 Gilbert et al. 


2 OI 2 I, and fields at similar radii can host clearly distinct 


stellar populatio ns, indicating that not all stars share a 


comm on origin (Ferguson et al. 2002 


2008). The number ot giant str eams discovered i n M3T 


Richardson et al. 


outnumbers of that of the MW (Ibata et al.||2007 ). Fur¬ 
thermore, the slope of the surface brightness profile of 


slope a ~ 

-2 ([Gilbert et al. 

2012 

Dorman et al. 

2013 

Ibata et al. 

|2(J14p. in comparison, t 

tie surface brigl 

itness 


is much steeper, falling as r~ ^, suggesting a mor e quies¬ 
cent recent accretion history ( Deason et al.|20r4 ). M3I’s 
gas disk is also strongly warped in the outer regions: the 
major axis position angle of the gas disk changes by 10°, 
and the inclination by 7°, between R = 20 and 40 kpc 
( Chemin et al.|[2009 Corbelli et al.||2010| . 

A quantitative comparison between the velocity dis¬ 
persion of M3I’s old stellar disk and cosmological pre¬ 
dictions requires the measurement of the shape of M3I’s 
velocity ellipsoid. We save this measurement for a future 
paper (Dorman et al. 2014b, in preparation) because 
it requires the choice of a particular disk rotation curve 
model. 


5.5. Radial trends 

In Figure we show velocity dispersion as a func¬ 
tion of radius: the median and standard deviation of the 
velocity dispersion in each I kpc wide radial bin in the 
plane of the disk for each age bin. We only include stars 
within ±20° of the major axis, because near the minor 
axis a single smoothing circle over which dispersion is 
calculated may cover a wide range of radii. 

The figure reveals two distinct radial regions in the 
galaxy. Inside 10 kpc, the velocity dispersions of all age 
groups are inflated by a factor of 1.5 —2 from their values 
outside, and the dispersions of the three older age groups 
are similar to each other, especially interior to 8 kpc. The 
dispersions of all age groups decline steeply with radius 
in this inner region. Outside the B9 region, the velocity 
dispersions of the different age groups separate from one 
another, revealing the clean age vs. dispersion relation¬ 
ship, and decline less steeply with radius. The transition 
radius may correspond to a transition in the dominant 
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^^disk (kpc) 


Figure 16. Dispersion as a function of radius in the plane of 
the disk r^isk for each age bin. Age bins are color coded as in 
previous plots. The solid line and shaded region correspond to 
the median and standard deviation of the velocity dispersion in 
each 1 kpc wide radial bin. r^isk is computed assuming a constant 
major axis position angle of 38° and disk inclination of 77°. Only 
stars within ±20° of the major axis are used, since near the minor 
axis distance measurements become more sensitive to precision in 
inclination, and a single smoothing circle may cover a wide range 
of radii. Velocity dispersion decreases with radius in all bins out 
to about 10 kpc. 


heating mechanism: in the outer regions, the clean age- 
dispersion relation implies a continuous heating process 
over the past several Gyr. Inside, a single perturbation 
has modified the orbits of stars older than 100 Myr. This 
could be a single event such as a very recent satellite im¬ 
pact, or a large fraction of the older stars in this region 
could belong to a single, kinematically distinct compo¬ 
nent such as the bar or a thick disk with short scale 
length. 

The 10 kpc break corresponds to a bit less than two ra- 


dial scale lengths 

|Worthev et al. 12005 

Seigar et al.|2008 

Ibata et al.|2005j 

Dorman et al.|2013) 

I'his is consistent 

with the “mass i 

ollows light” radial 

rend seen in stellar 


kinematics in the Dis kMass survey of nearly fa ce-on, late- 
type spiral galaxies (Martinsson et al. 2013): the LOS 
stellar velocity dispersion typically decreases with radius 
out to two scale lengths, after which the data become 
noisy enough that the trend becomes consistent with 
flat. It is worth noting that because the DiskMass sur¬ 
vey galaxies are much closer to face-on than Andromeda 
is (< 30° vs. 77°), the DiskMass LOS dispersion profiles 
are sensitive to the vertical component of the velocity 
dispersion oz while our LOS velocity dispersion has a 
negligible contribution from oz- 


6. SUMMARY 

We have split a spectroscopic sample of 5800 stars from 
the disk of M31 into four age groups based on optical 
HST photometry, and mapped the line-of-sight velocity 
dispersion of each age group. We have found: 


• Stellar velocity dispersion and age are directly cor¬ 
related in the disk of M31. The slope of the best 
fit line assuming a constant SFR is approximately 
16 km s“^ Gyr“^, more than 5 times higher than 


that of the Milky Way. 

• Among RGB stars in our sample, the metal-poor 
half is 50% kinematically hotter than the metal- 
rich half. 

• The stellar disk is kinematically clumpy on scales 
smaller than about 200" (760 pc). Most of the 
structure can be attributed to sampling effects. 
However, the line of sight velocity distribution of 
the RGB and AGB stars in the highest-dispersion 
regions is quite asymmetric, with a low-velocity 
tail. This asymmetry indicates the presence of 
an additional kinematic component that is not 
smoothly distributed across the galaxy. 

• There is a patch on the major axis about 6 kpc from 
the galactic center where the dispersions of all four 
age groups are inflated by a factor of 1.5 — 2. This 
patch may correspond to the end of the long bar. 
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APPENDIX 


This paper presents, for the first time, resolved stellar kinematics derived from Keck/DEIMOS observations from 
Fall 2012. Table [l] lists the slitmasks observed and the number of quality radial velocities recovered from each. 

The target selection was done in a non-uniform way in order to maximize the number of younger (main-sequence, 
asymptotic giant branch, blue and red supergiant) stars relative to the dominant red giant branch population. As 
such, the spectroscopic selection function varies with color, magnitude, and position on the sky, though it can be easily 
recovered by comparing the distribution of targets to the full PHAT catalog. 

The raw data were flat-fielded, locally sky subtracted, c ollapsed into 1-D spectra, and cross-correlated against 


template spectra to measure radial velocities as described in Dorman et al. (2012 2013). 
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Table 1 

Keck/DEIMOS Multiobject Slitmask Exposures from Fall 2012 


Mask 

Name 

Observation 
Date (UT) 

a [J2000] 

(h m s) 

<5 [J2000] 

(0 , 

P.A. 

n 

^exp 

(sec) 

Seeing 

FWHM 

No. of 
Slits 

No. of Usable 
Target Velocities 
(Success Rate) 

No. of Usable 
Velocities of 

Serendipitously Detected 
Stars 

mctOC 

2012 

Sept 

18 

00 

44 

47.17 

41 22 

00.0 

-140.0 

3 X 1020 

0”.6 

225 

184 (82%) 

31 

mct6D 

2012 

Sept 

20 

00 

44 

34.60 

41 29 

44.6 

170.0 

3 X 1200 

0”.8 

208 

167 (80%) 

38 

mct6E 

2012 

Sept 

18 

00 

44 

13.52 

41 19 

05.5 

-20.0 

2 X 1080 

0”.6 

228 

107 (47%) 

16 

mct6F 

2012 

Sept 

19 

00 

45 

54.72 

41 41 

58.6 

-164.0 

3 X 1020 

0”.6 

221 

192 (87%) 

31 

mct6G 

2012 

Sept 

20 

00 

45 

38.34 

41 43 

37.4 

-155.0 

3 X 1200 

0”.9 

231 

178 (77%) 

24 

mct6H 

2012 

Sept 

18 

00 

45 

26.92 

41 44 

04.3 

+15.0 

3 X 1020 

0”.6 

244 

185 (76%) 

24 

mct6I 

2012 

Sept 

19 

00 

44 

10.25 

41 25 

16.2 

-95.0 

3 X 1080 

0”.7 

209 

138 (66%) 

52 

mct6K 

2012 

Sept 

18 

00 

44 

38.26 

41 37 

22.5 

-95.0 

3 X 1020 

0”.7 

207 

155 (75%) 

45 

mct6L 

2012 

Sept 

19 

00 

46 

05.81 

42 02 

28.1 

-20.0 

3 X 1020 

0”.8 

246 

212 (86%) 

8 

mctOM 

2012 

Sept 

20 

00 

44 

36.98 

41 32 

39.5 

-20.0 

3 X 1080 

0”.75 

227 

153 (67%) 

40 

mct60 

2012 

Sept 

20 

00 

45 

08.17 

41 52 

34.0 

-80.0 

2 X 1080 + 1 X 1140 

0” .93 

240 

169 (70%) 

24 

mct6P 

2012 

Sept 

19 

00 

45 

30.50 

41 55 

37.6 

-80.0 

2 X 1019 + 1 X 855 

0”.6 

229 

161 (70%) 

10 

mct6Q 

2012 

Sept 

18 

00 

45 

35.98 

42 00 

17.9 

-80.0 

3 X 1080 

0”.8 

225 

155 (69%) 

21 

mct6R 

2012 

Sept 

19 

00 

47 

02.28 

42 12 

07.3 

-80.0 

3 X 1080 

0”.8 

227 

175 (77%) 

9 

mct6S 

2012 

Sept 

20 

00 

47 

02.27 

42 09 

25.2 

-80.0 

3 X 1200 

0”.8 

224 

163 (73%) 

18 

mct6T 

2012 

Sept 

18 

00 

45 

57.60 

42 01 

12.0 

-40.0 

3 X 1020 

0”.6 

245 

210 (86%) 

10 

mctOU 

2012 

Sept 

20 

00 

46 

13.24 

42 14 

35.6 

+50.0 

3 X 1200 

0”.8 

254 

185 (73%) 

5 

mct6V 

2012 

Sept 

18 

00 

46 

40.80 

42 13 

48.0 

-130.0 

2 X 1080 + 1 X 1020 

0”.6 

226 

126 (56%) 

13 

mct6W 

2012 

Sept 

19 

00 

46 

51.73 

42 12 

46.0 

-130.0 

3 X 1020 

0”.6 

225 

175 (78%) 

8 

mct6X 

2012 

Sept 

19 

00 

46 

22.94 

42 01 

58.0 

140.0 

3 X 1020 

0”.7 

225 

173 (77%) 

15 

Total: 












4566 

3368 (74%) 

442 




